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ITERATED SMOOTHED BOOTSTRAP CONFIDENCE INTERVALS 
FOR POPULATION QUANTILES 

By Yvonne H. S. Ho and Stephen M. S. Lee 1 

The University of Hong Kong 

This paper investigates the effects of smoothed bootstrap itera- 
tions on coverage probabilities of smoothed bootstrap and bootstrap-t 
confidence intervals for population quantiles, and establishes the opti- 
mal kernel bandwidths at various stages of the smoothing procedures. 
The conventional smoothed bootstrap and bootstrap-t methods have 
been known to yield one-sided coverage errors of orders 0(n _1//2 ) and 
o(n~ 2// ' ! ), respectively, for intervals based on the sample quantile of 
a random sample of size n. We sharpen the latter result to 0(n _5//6 ) 
with proper choices of bandwidths at the bootstrapping and Stu- 
dentization steps. We show further that calibration of the nominal 
coverage level by means of the iterated bootstrap succeeds in reduc- 
ing the coverage error of the smoothed bootstrap percentile interval 
to the order 0(n~ 2 ^ 3 ) and that of the smoothed bootstrap-t interval 
to 0(n _58,/57 ), provided that bandwidths are selected of appropriate 
orders. Simulation results confirm our asymptotic findings, suggest- 
ing that the iterated smoothed bootstrap-t method yields the most 
accurate coverage. On the other hand, the iterated smoothed boot- 
strap percentile method interval has the advantage of being shorter 
and more stable than the bootstrap-t intervals. 

1. Introduction. It is generally known that under Bhattacharya and 
Ghosh's (1978) smooth function model, the bootstrap percentile method 
confidence interval is subject to a one-sided coverage error of order 0(n -1 / 2 ), 
rendering it indistinguishable from the classical normal approximation method. 
Hall (1986) shows that Studentization can be employed to reduce this error 
to 0(n _1 ). The iterated bootstrap offers an alternative to error correction by 
calibrating the nominal coverage level iteratively; see Beran (1987). Hall and 
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Martin (1988) show that each such iteration reduces the one-sided coverage 
error by an order of 0(ra~ 1//2 ) successively. On the other hand, smoothing 
the bootstrap, which amounts to drawing bootstrap samples from a kernel- 
smoothed empirical distribution, instead of sampling with replacement from 
the raw data, does not affect the coverage accuracy of bootstrap intervals in 
general. Polansky and Schucany (1997) propose smoothed bootstrap strate- 
gies to yield intervals of 0(n~ 1 ) coverage error. Their methods, however, 
require sophisticated tuning of the smoothing bandwidths, rendering the 
improvement less stable than that resulting from Studentization or the iter- 
ated bootstrap. 

In contrast to problems in the context of smooth function models, con- 
ventional bootstrap confidence intervals for the gth population quantile, for 
a fixed q € (0, 1), have notably poor coverages; see Hall and Martin (1989). 
Here the percentile method gives coverage error of order 0(n -1 / 2 ), which 
cannot be improved upon by nominal coverage calibration using the iterated 
bootstrap. Indeed, more generally, this 0{n~ l l 2 ) coverage error is inherent 
in any confidence interval procedure based directly on order statistics as a 
consequence of their binomial-type discreteness. See, for example, De An- 
gelis, Hall and Young (1993) for a more detailed account of the above phe- 
nomenon. On the other hand, either the smoothed bootstrap or Studentiza- 
tion extends considerably the domain from which we derive the confidence 
limits, and may therefore be able to make asymptotic improvement over the 
conventional bootstrap percentile method. 

In the context of estimating the variance cr 2 of the sample gth quan- 
tile, Hall, DiCiccio and Romano (1989) show that sufficiently high-order 
smoothing of the bootstrap succeeds in reducing the relative error of the 
unsmoothed bootstrap estimate from 0(n _1//4 ) to 0{n~ l / 2+e ) for arbitrar- 
ily small e > 0. Falk and Janas (1992) show that smoothing the bootstrap 
percentile method returns the same order, 0(n~ 1 / 2 ), of coverage error as in 
the unsmoothed case. Studentization of the sample quantile involves estima- 
tion of cr 2 , which admits an expansion n~ 1 q(l — q) f '(i 7-1 (q))~ 2 + 0(n -3 / 2 ) 
under proper regularity conditions, where / = F' and F denotes the distri- 
bution function underlying the given random sample. In practice cr 2 has to 
be estimated from the sample by, for example, bootstrapping or explicit es- 
timation of the leading term in its expansion above. Hall and Martin (1991) 
show that confidence intervals based on normal approximation of the sam- 
ple quantile Studentized by the unsmoothed bootstrap variance estimate 
yield coverage error of order 0(n -1 / 2 ). Falk and Janas (1992) show that a 
similar result holds when the variance is estimated by plugging in the ker- 
nel density estimate of /. However, application of the smoothed bootstrap 
to the Studentized sample quantile in the latter case, which we shall term 
the smoothed bootstrap-t method, succeeds in improving the error order to 
o(n -2 / 3 ), if second-order kernels are used in conjunction with appropriately 
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selected bandwidths at both the Studentization and bootstrapping steps; see 
Janas (1993). This result will be sharpened in Section 2.4, where we prove 
that the smoothed bootstrap^ method can indeed yield a coverage error of 
precise order 0(n -5 / 6 ). 

We investigate further in this paper the asymptotic effects on coverage 
error of iterating the smoothed bootstrap or bootstrap^ methods, with the 
objective of generating confidence intervals with improved coverage accu- 
racy. De Angelis, Hall and Young (1993) remark in passing without proof 
that iterating the smoothed bootstrap percentile method might reduce cov- 
erage error. They mention neither the degree of improvement nor the choices 
of kernel bandwidths, and implementation of the iterated smoothed boot- 
strap remains impractical. Iterating the smoothed bootstrap-i has not been 
explored in the literature. We formalize the theory of the iterated smoothed 
bootstrap and bootstrap-i methods by stating explicitly the orders of their 
coverage errors, and derive therefrom the optimal sizes of kernel bandwidths 
at the two levels of smoothed bootstrapping and also at the Studentization 
step where applicable. Specifically, we show for the smoothed bootstrap 
method that calibration of the nominal coverage level by one extra level 
of smoothed bootstrapping can reduce the coverage error from 0(n~ 1 / 2 ) 
to 0(?i~ 2 / 3 ), provided that the bandwidths are chosen of order ranging 
from n -2 / 9 to n -1 / 12 at the outer level of bootstrapping, and of order n -1 / 3 
at the inner level. For the smoothed bootstrap-i method, such coverage cali- 
bration succeeds in reducing the coverage error from 0(n~ 5 / 6 ) to 0(n~ 58 / 57 ), 
provided that the bandwidths are chosen of order n -2 / 19 at the outer level, 
of order n -11 / 57 a t the inner level and of order ranging from n ~ n / 19 to n -1 / 2 
at the Studentization step. The latter result signifies thus by far the best cov- 
erage accuracy achievable by bootstrap confidence intervals proposed in the 
literature for the population quantile. It also outperforms Beran and Hall's 
(1993) interpolated confidence interval, which is based on a convex combina- 
tion of sample quantiles and yields a coverage error of order 0(n _1 ). Chen 
and Hall's (1993) smoothed empirical likelihood interval, which is based on 
smoothing the standard empirical likelihood procedure, has two-sided cover- 
age error of order 0(n _1 ). They show also that Bartlett correction reduces 
the error further to 0(n~ 2 ) and a simple approximation to the correction 
results in error slightly smaller than 0(n~ 1 ). However, the one-sided cov- 
erage error of the smoothed empirical likelihood interval, Bartlett-corrected 
or not, remains of order 0(n -1 / 2 ). For, as we can see from (6.10) and (6.14) 
of Chen and Hall (1993), the one-sided coverage expansion for the interval 
consists of a nonvanishing term of order n -1 / 2 . This term persists even after 
Bartlett correction, which affects only terms of order 0(n _1 ). Our iterated 
smoothed bootstrap and bootstrap-i intervals thus compare favorably with 
the smoothed empirical likelihood approach in terms of one-sided coverage 
accuracy. Moreover, our approaches have the further advantage of being 
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extendable by additional bootstrap iterations to yield successively more ac- 
curate coverages, subject only to availability of computer resources and the 
extent to which asymptotic implications can be realized in practice. 

Simulation results suggest that our two iterated bootstrap methods yield 
coverages much more accurate than their noniterated counterparts. The im- 
provement is more significant in the non-Studentized case. In general our 
methods have accuracies comparable to the interpolation or smoothed em- 
pirical likelihood methods, but improving at a faster rate as n increases. 
Despite its impressive coverage accuracy, the iterated smoothed bootstrap-i 
method suffers, as expected, from the problem of instability pertinent to 
variance estimation for a sample quantile, which often results in overly long 
and highly variable confidence intervals. On the other hand, the iterated 
smoothed bootstrap percentile approach, albeit slightly less accurate, is rel- 
atively much more stable than both the noniterated and iterated smoothed 
bootstrap^ methods. 

Success of the iterated bootstrap in the present context extends its scope 
of application beyond the traditional regular problem settings and beyond 
the conventional, unsmoothed bootstrap procedures, yielding asymptotic im- 
provement of a problem-specific nature. This confirms the potential of the 
iterated bootstrap as a general strategy for improving upon the bootstrap 
not just in cases where the conventional bootstrap works satisfactorily, such 
as in smooth function model settings, but also in cases where it does not 
work as satisfactorily, such as in the quantile case, and where a modified 
form of the bootstrap, such as the smoothed bootstrap, is required. 

Section 2.1 introduces notation and states the regularity conditions re- 
quired for the asymptotic theory. Sections 2.2 and 2.3 establish asymptotic 
expansions for the coverage probabilities of the noniterated and iterated 
smoothed bootstrap percentile method intervals, respectively, while their 
Studentized counterparts are treated in Sections 2.4 and 2.5. Based on our 
asymptotic results, we derive for each type of interval the optimal orders 
of kernel bandwidths at each level of bootstrapping and, where applicable, 
at the Studentization step in order to minimize coverage error. Section 2.6 
discusses an alternative approach, which bases the confidence set root on 
a smoothed version of the sample quantile, to constructing bootstrap con- 
fidence intervals. Section 3 addresses the issue of empirical determination 
of bandwidths and suggests a bootstrap solution to the problem. Section 4 
reports two simulation studies. The first demonstrates the bootstrap pro- 
cedure for setting optimal bandwidths. The second shows that the iterated 
bootstrap improves upon the smoothed bootstrap and bootstrap-t methods 
in terms of coverage accuracy. The iterated smoothed bootstrap interval 
also excels in terms of other indicators of interval performance. Section 5 
concludes our findings. All technical details are given in the Appendix. 
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2. Theory. 

2.1. Notation and assumptions. Let X = {X\, . . . , X n } be independent 
and identically distributed from an unknown distribution F with density 
f = F' . The parameter of interest is the qth population quantile F~ 1 (q) = 
inf{x € R :F(x) > q}, for a fixed q £ (0, 1). We wish to construct a nominal 
level 1 — a upper confidence interval for F~~ 1 (q), where < a < 1. 

Let F n be the empirical distribution of X, which assigns a probability 
mass of n" 1 to each X$, i = 1, and let F n „ be its kernel-smoothed 
version with density f n ,ri-> such that 

n n 

F ntV (t)=n~ 1 Y / K((t-X i )/r ] ) and / n> „(i) = (m?)" 1 £ fe((t - 
i=l i=i 

for a kernel function A;, -KT(i) = dx and a bandwidth t/ > 0. See Sil- 

verman (1986) for a general exposition of kernel density estimation. Note 
that the unsmoothed and smoothed sample qth quantiles are given, respec- 
tively, by F~ 1 (q) and F'^q). 

Write q = min(g, 1 — q). Let $ be the standard normal distribution func- 
tion and let </> = $' be its density. Define, for e € (0, q), D±, Di > and j = 
1,2, ... , J-j(e,Di,D2) to be the class of distribution functions F satisfying 
the following smoothness conditions: (i) F -1 is j times continuously differen- 
tiable in (q-e,q+e), (ii) (F" 1 )'^) > D 1 and (hi) max^i,...^ sup pe(g „ e 9+£ ) < L> 2 - 

It is clear that Tj + i(e, Di, D2) C JFj(e, D±, D2) for j = 1,2, We shall 

establish coverage expansions for our iterated and noniterated bootstrap 
intervals under F € J-2(s, D±, D2) for the non-Studentized case and F € 
.7-4 (e,-D 1,1)2) for the Studentized case. 

We make the following assumptions on the kernel k throughout the pa- 
per: 

(Al) k is nonnegative, symmetric about zero and has compact support 

[—a, a], for some a > 0; 
(A2) few) exists and is bounded on [—a, a] for j = 1,2,3,4; 
(A3) JZ k(x)dx = f~ 00 x*k(x)dx = l. 

Note that the above assumptions require that A: be a proper density function, 
symmetric about 0, on the interval [—a, a]. 

2.2. Smoothed bootstrap percentile method. Let X^ = {-X"}, . . . , X^} be a 
random sample simulated from F n „. We may set in practice xj = Y* +r)W* , 
i = l,...,n, where the Y* and the W* are independent random numbers 
drawn from the distributions F n and K, respectively. Denote by F* rj the 
empirical distribution of X<. Define 

G n (t) = ¥{n 1/2 {F-\q) - F~ l (q)) <t}, t € R. 
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The smoothed bootstrap version of G n (t) substitutes F n v for F, and is given 
by 

G n , v {t)=nn 1 '\F*-\q)-F-] n {q))<t\X}. 

Following Beran's (1987) prepivoting idea, the root n 1 ^ 2 (F~ 1 (q) — F~ 1 (q)) 
can be transformed, by prepivoting with G n „, into a random variable ap- 
proximately uniformly distributed over [0, 1] . It is clear in the context of con- 
fidence interval construction that the above action of prepivoting amounts 
to smoothed bootstrap estimation of the quantile of n 1 ^ 2 (F~ 1 (q) — F~ 1 (q)). 
This defines a noniterated smoothed bootstrap percentile upper confidence 
interval, of nominal level 1 — a, to be 

Write for brevity a q = {q{l — q)} 1 ^ 2 - The following theorem establishes an 
asymptotic expansion for the distribution of the prepivoted root, and hence 
the coverage probability of Ii j(X . 

Theorem 1. Under conditions (Al)-(A3) and assuming that r\ cx n~ Av 
with 1/4 < A„ < 1/2, we have that, for a € (0, 1), 

nGnAn l/2 [F-\q)-F-\q)])>a} 
= F{F~ 1 (q)eh, a } 

+ C ,( r? 2 +n -2 r? -4 + n -l/2 7? l/2 ) 

uniformly in F € J~2( £ j Di,D2), for any e € (0, q) and D\,Di > 0. 

We see from Theorem 1 that I\^ a has coverage error of precise order 
0(n~ 1 / 2 ), provided that F G J r 2{e,D\,D2) and the bandwidth r/ cx n~ A? > is 
chosen such that 1/4 < < 3/8. Falk and Janas (1992) obtain an expansion 
similar to (1) for the coverage probability, up to order o(ra -1 / 2 ), under the 
restrictive condition that rj = o(ra -1 / 3 ). The expansion (1) in our Theorem 1 
gives an error term that reveals the explicit influence of the bandwidth rj 
on the coverage, which is crucial to our subsequent study of the effects of 
bootstrap iterations. 

2.3. Iterating the smoothed bootstrap percentile method. In standard sit- 
uations, the iterated bootstrap has been known to be very effective in en- 
hancing coverage accuracy of confidence intervals. It operates by calibrating 
either the nominal level or the interval end points, with the use of an addi- 
tional level of bootstrapping. We shall focus only on the former approach, 
which conforms exactly to Beran's (1987) prepivoting idea. 
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Define G nr) as in Section 2.2 based on a generic smoothed bootstrap 
sample X J drawn from F n , v . Let X* = {X£,...,X*} be a generic outer- 
level smoothed bootstrap sample drawn from F n p, for a kernel bandwidth 
j3 > 0, and let F* g be its empirical distribution. Define, for r] > 0, H n>v by 

H n ,r)(t) = n~ l Y^i = \K{{t — X*)/rj), which corresponds to a smoothed em- 
pirical distribution of X* . Similarly we denote by X** a generic inner-level 
smoothed bootstrap sample drawn from H n>r) , and by H* the (unsmoothed) 
empirical distribution of X** . Define 

= nn^iK^iq) - H-fa)) < t\X, X*}, t 6 R. 

Then the smoothed bootstrap estimates the distribution function of the 
prepivoted root G nyTI {n 1 / 2 [F~ l {q) — F~ l (q)\) by the conditional distribu- 
tion, J nAv say, of G^ v (nV\F*~p(q) - F~^(q)]) given X. Prepivoting with 

Jn,/3,r) leads to a twice-prepivoted root J ni( g i .^(G' n)r? (n 1 / 2 [F J ^ 1 (g) — F~ 1 (q)])), 
which is asymptotically uniformly distributed over [0,1]. In the context of 
confidence interval construction, this amounts to estimation of the ath quan- 
tile of n 1 ^ 2 (F~ 1 (q) —F~ 1 (q)) by G~^ ] (J~'p „(«))■ The corresponding iterated 
smoothed bootstrap upper confidence interval, of nominal level 1 — a, is 

I 2>a = (-oo,F-\q) -n^G-^J-lja))}. 

Note that in the above procedure we have allowed use of two different band- 
widths, (3 and rj, for the two levels of smoothed bootstrapping. This proves 
to be crucial to achieving asymptotic improvement in terms of coverage ac- 
curacy by means of the iterated bootstrap. The following theorem states 
in asymptotic terms how close the twice-prepivoted root is to a uniform 
random variable, and hence establishes the coverage error of l2,a- 

Theorem 2. Assume the conditions in Theorem 1, that F £ J~2{£, D\, D2) 
for some e E {0,q) and D\,Di > 0, and that (3 oc n - ^ with < A^ < 1/3. 
Then we have 

= P{F- 1 (q)eI 2 , a } 

= l- a + ( V 2 + n" V 4 + n- l 'W 2 + n- 1 ' 2 l3 2 + n" 1 /?" 372 )- 

We see from Theorem 2 that the two levels of smoothed bootstrapping 
contribute separately to the coverage error of /2 a! which can be minimized to 
achieve 0(n -2 / 3 ) by setting rj oc n - 1 / 3 and /3 oc n p with 1/12 < A^ < 2/9. 
The iterated smoothed bootstrap method thus improves upon the nonit- 
erated I\^ a . We note, however, that application of the iterated smoothed 
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bootstrap in the quantile problem does not yield the same level of improve- 
ment as has been well known in smooth function model situations, where 
each iteration of the (unsmoothed) bootstrap reduces coverage error by an 
order of 0(n _1//2 ). 

2.4. Smoothed bootstrap-t method. We review in this section the smoothed 
bootstrap-t method and derive explicitly the optimal orders of bandwidths 
that minimize its coverage error. Janas (1993) establishes that the smoothed 
bootstrap-t method yields coverage error of order o(n~ 2 / 3 ). Our results 
sharpen those of Janas (1993) by giving the precise order, namely 0(n _5//6 ), 
of the minimum coverage error. Noting that 

ol = VarCF- 1 ^)) = n~\(l - q) f (F -1 (q))~ 2 + 0(n~ 3 / 2 ), 

we may estimate ncr 2 , on plugging in a kernel density estimate for /, by 

U=l 

for some bandwidth £ > and kernel function h, which is assumed to satisfy: 

(Bl) h is nonnegative, symmetric about zero and has a compact support 

[—6, b], for some b > 0; 
(B2) for some decomposition — b = xq < x\ < ■ ■ ■ < x m = b, h! exists on each 

interval (xj-x,Xj), is bounded and is either strictly positive or strictly 

negative there; 
(B3) JZ h(x)dx = JZ x 2 h(x)dx = l. 

Recall that = {x\, . . . ,X^} denotes a random sample from F n „. Then 
the smoothed bootstrap version of s 2 - is given by 

f = q(l - <?)«) 2 h{{F* n -\q) - X\)/A ■ 

Define, for t € K, 

K n> s(t) = nn ll2 {F~\q) - F~\q))/s ( < t} 

and 

K n ^(t) = V{n x /\F*-\q) - < W- 

Then the smoothed bootstrap-t upper confidence interval of nominal level 
1 — a is 

I 3>a = (-00,^(9) - n-^K-^a)]. 
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Janas (1993) shows that if both h and k are chosen to be second-order, 
(3 oc n -1 / 3 and r/ oc ra -1 / 5 , the coverage error of I^ >a achieves an order of 
o(n~ 2 / 3 ). We shall show that the minimum order of this coverage error is 
in fact 0(n -5 / 6 ), provided that the orders of bandwidths 77, £ are chosen 
properly. 

Similar to Theorem 1, the following theorem establishes an asymptotic 
expansion for the distribution of the prepivoted root K ntri ^{n 1 ^ 2 [{F~ 1 {q) — 
F~ 1 (q))/s^\), and hence the coverage probability of I% y a- 

Theorem 3. Assume conditions (A1)-(A3), (B1)-(B3), and that 77 oc 
ra~ A " and £ oc n~ A s , with < < 1/5 < A ? < 1. Then, for a G (0, 1), 

nK n ^ 1/2 \(F^( q ) - F~ l {q))/§(\) > a} 
= P{F- 1 (g)€/ 3 , a } 

= 1 - a + (nriy l D 1A (F) + {n^rfD^F) 

+ rT^D^F) + n~ x D A>2 (F) + n^^D^F) 
+ 0(K)" 5/2 + < 5 + e /2 + n-^ 2 v 4 

+ + ^-3/2^-5/2 + n -3/2 r l/2 r? -l 

+ «)~V + n^C V V + (nv)'^ 1/2 

+ n- x f 2 i x 'W + rT 3 / 2 r V /2 + n" 1 / 2 ^!?- 5 / 2 + «)- 3/ V) 

(2) 

uniformly in F € ^(e, Z?i, D2), for any e € (0, <?) and D\,D2> 0. -f/ere, /or 
eac/i j = 1, . . . , 5, j(.F) denotes a smooth function of the density deriva- 
tives {f(F~ 1 (q)),f'(F~ 1 (q)), f^(F~ 1 (q))}. 

We see from Theorem 3 that I^, a has coverage error of order 0(n _5//6 ), 
provided that A, ; = 1/6 and 3/8 < A^ < 1/2. This suggests that I^^ a is 
asymptotically more advantageous than both I\ i0l and /2,a- Note the dif- 
ferent choices of bandwidth orders here as compared to Janas' (1993) rec- 
ommendation, which yields only an o(ra~ 2 / 3 ) coverage error for Iz )a . 

It may be possible to further reduce the coverage error of either the iter- 
ated smoothed bootstrap or the smoothed bootstrap-i method if a higher- 
order kernel function k is employed. In fact, Janas (1993) shows that the 
error of 1%^ can be made as small as 0(n~ 1+e ), for any e > 0, based on ker- 
nels of sufficiently high order. Similar results hold for the iterated smoothed 
bootstrap, suggesting that both I 2a and 1%^ are essentially indistinguish- 
able in terms of asymptotic coverage accuracy when high-order kernels are 
used. Our discussion nevertheless focuses on the practically more impor- 
tant second-order kernels, which have the virtue of being nonnegative and 
therefore allow straightforward Monte Carlo simulation from the resulting 
smoothed empirical distributions. 

Studentization by s^ requires no direct simulation from the kernel h. We 
may therefore relax the second-order condition on h without inflicting extra 
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computational difficulty. Hall, DiCiccio and Romano (1989) show that use 
of a higher-order kernel h can actually speed up the convergence rate of s^. 
However, we see from (2) that the best achievable coverage error of is 
determined critically by the (n??) -1 and n _1 / 2 r/ 2 terms. Increasing the order 
of h affects only terms involving its bandwidth £, and can therefore not 
reduce the coverage error further. 

It would be intriguing to explore the possibility of iterating the smoothed 
bootstrap-t method to produce even more accurate confidence intervals for 
quantiles. We address this issue in the next section. 

2.5. Iterating the smoothed bootstrap-t method. We follow the definitions 
used in Section 2.3 for bootstrap samples X\ X* and X**. The iterated 
smoothed bootstrap version of s 2 is given by 

s^ = q (l- q )(n0 2 I^H(H*-\ q )-Xn/0^ ■ 

Define 

Kl^{t)=T{n ll2 {H* n -\ q ) -H-^{q))/sf <t\X,X*}, t € R. 

Similar to the construction of the iterated smoothed bootstrap interval l2,a, 
we set L ni/ 3 >r?i g to be the conditional distribution of K* ^(n 1 ' 2 [F*~^ (q) — 

Fn}(i)}/ S l) e iven where 

Sf =q(l-q)(n0 2 ^h((F^\q)-X*)/^ ■ 

The required twice-prepivoted root is given by 

L nM {k n ^{n l l 2 [F-\q) - F~ 1 {q)]/ s^j)^ 

and the ath quantile of n l l 2 (F~ l {q) — F~ 1 (q))/s^ is estimated by ^(L^ ^( 

where K n ^ v £ and are defined as in Section 2.4. The corresponding iterated 
smoothed bootstrap-t upper confidence interval, of nominal level 1 — a, is 

J 4 , a = (-oo,i^H?)-n" 1/2 ^J, e (^i, IJ> f(«))]- 

Note that construction of involves three different bandwidths: £ at the 
Studentization step, (5 at the outer level and 77 at the inner level of smoothed 
bootstrapping. The following theorem establishes the order of the coverage 
error of in terms of the three bandwidths. 
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Table 1 

Optimal bandwidth orders, on the — log n scale, and corresponding 
coverage errors of Ij, a , for j — 1, 2, 3, 4 

— log n (bandwidth) 







l2,a 


l3,a 


l4,a 


Outer-level 




[1/12,2/9] 




2/19 


Inner- level 


(1/4,3/8] 


1/3 


1/6 


11/57 


Studentization step 






[3/8,1/2] 


[1/2,11/19] 


Coverage error 


Oin- 1 / 2 ) 


0(n" 2 / 3 ) 


0(n" 5/6 ) 


0(n" 58 / 57 ) 



Theorem 4. Assume the conditions of Theorem 3, that F € J-^s, D±, D2) 
for some e E (0, q) and D\,D2 > 0, and that f3 oc n~^P with < A^ < 1/7. 
Then 

= P{F- 1 (( 7 )e/ 4 , Q } 

= 1 - a + Odinri)- 1 + n'^C 1 }^ 2 + n" 1 / 2 /?" 3 / 2 ) 
+ [«)~ 1 ? ? 2 + n- 1 ](/? 2 + n- 1/2 /3- 5/2 ) 

[6) + (^)-5/2 +nf 5 + ^/2 + n -l/ 27? 4 

+ n -l^l/2 + n-3/2^-5/2 + ^-3/2^-1/2^-t 

+ «)- 1 ^ 4 +n- l r 1/ V 

+ (nr/)" Y /2 + n" 1 / 2 ^/ 2 ?/ 2 + n" 3 ^" V /2 
+ n- 1 ^V 5/2 + K)- 3/2 r/ 2 ). 

We see from Theorem 4 that the iterated smoothed bootstrap-i inter- 
val l^a can achieve an o(n~ 1 ) coverage error. For instance, setting A^ = 
11/57, 1/2 < A^ < 11/19 and A^ = 2/19 guarantees a coverage error of or- 
der 0(n" 58 / 57 ). The precise minimum order of coverage error of a can 
be derived from a more detailed but uninspiring asymptotic expansion than 
that displayed in (3), which we omit here for simplicity. 

We remark that the above iterated smoothed bootstrap-t construction 
gives the fastest convergence rate of coverage as compared to other, smoothed 
or unsmoothed, bootstrap constructions thus far proposed in the literature. 
Not even the use of a high-order kernel k, which typically yields a cover- 
age error of order 0(n~ 1+e ) for an arbitrarily small e > and a sufficiently 
high kernel order, is able to surpass this result. Successive iterations of the 
smoothed bootstrap procedure reduce the coverage error of further. The 
forbidding task of managing a large number of bandwidths in a single asymp- 
totic expansion prevents us from exploring this option further, although we 
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recognize its potential in producing asymptotic improvement. The interpo- 
lation method proposed by Beran and Hall (1993) gives a confidence interval 
of 0(n _1 ) coverage error, which cannot be improved upon by higher-order 
interpolations. 

We see from (3) that the 0(n" 58 / 57 ) coverage error of I4 a is determined 
strictly by terms of orders n~ 1 r/~ 1 (3 2 , n _1 r/ 2 /?~ 7 / 2 and re" 3 / 2 //" 5 / 2 . Similar 
to the case of smoothed bootstrap-/; construction, the coverage error of 
cannot be further reduced by increasing the order of the kernel function h 
used for Studentization, which affects only terms involving £. 

Table 1 above summarizes the optimal choices of bandwidth orders and 
the corresponding one-sided coverage errors for trie four intervals /i q, 1% oli 
h,a and I^ a . 

2.6. Smoothing the sample quantile: an alternative. Define 

n rt 
/n,c(*) = K)" 1 E <^ ~ X i)IO and F nA (t) = / / n ,c(x) dx, 
i=i J -°° 

for a kernel function k and a bandwidth £ > 0. A smoothed version of 
the sample quantile F~ 1 (q) may be defined by F~^(q), which we term the 
smoothed sample qth quantile. We now examine the effects on coverage er- 
ror of basing the bootstrap confidence intervals on the root n}' 2 {F~^(q) — 

F~ 1 (q)) instead of n 1 / 2 (F~ 1 (g) - F~ 1 (q)). Define 

G njf (t) = P{n 1 / 2 (F- C 1 (g) - F~\q)) <t}, t € R, 

and G nr) f to be the smoothed bootstrap version of G n ^ with F substituted 
by F n>ri . The corresponding smoothed bootstrap percentile upper confidence 
interval, of nominal level 1 — a, is 

I£ a = (-oo, F-^q) - n-V 2 G-^ c (a)). 

The following theorem establishes an asymptotic expansion for the coverage 
probability of Jf a . 

Theorem 5. Suppose that F G ^(e, D\, D2) for some e G (0,q) and 
D\,D2 > 0, and that k is a second-order nonnegative kernel function. Under 
conditions (A1)-(A3) and assuming that rj oc n _A,f with 1/4 < A„ < 1/3 and 
Q oc n~ A ? with > 3/8, we have, for a G (0, 1), that 

(4) FiF-^q) G I^J = 1 - a + n~ x l 2 E K + o^" 1 / 2 ) + 0{n 3 / 2 ( 4 + n^C 2 ), 
for some nontrivial constant E K independent o/n, £. 
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We see from Theorem 5 that If a has coverage error of precise order n" 1 / 2 
provided that > 1/2. In any case, the order of the coverage error can- 
not be reduced further by adjusting the bandwidth £. The best achievable 
coverage errors of both Ii )Q and If a are of order 0(n _1//2 ), so that basing 
the smoothed bootstrap interval on the smoothed sample quantile does not 
yield any asymptotic improvement upon that derived from the sample quan- 
tile. We conjecture that arguments similar to those proving Theorem 5 can 
be employed to show that the smoothed sample quantile approach has no 
advantage either in the Studentized case. 

3. Empirical determination of bandwidths. We now turn to the problem 
of empirical determination of the optimal bandwidths in practice. Many 
different practical strategies have been proposed for bandwidth selection 
in other problem settings, which often permit natural adaptation to our 
present context. Plausible approaches include, for example, cross-validation, 
an extra level of bootstrapping and plugging-in of sample quantities into 
asymptotic expansions of optimal bandwidths, among others. 

Despite its computational intensity, the bootstrap approach is arguably 
the most straightforward method for setting optimal bandwidths. A smoothed 
bootstrap procedure for setting bandwidths is as follows. First, we gener- 
ate an outermost level of smoothed bootstrap samples from F n ^, for some 
bandwidth 7 > 0. The collection of such samples, denoted generically by X° , 
forms the basis for our estimation of coverage probabilities and hence the 
determination of the best bandwidths. With reference to the optimal orders 
of bandwidths provided in Table 1, we set up a grid of pilot values of band- 
widths for use in our procedure. For example, when considering l2, a , we may 
select /3 r 's to be evenly spaced points within the range 
and 's to be evenly spaced points within the range [Cn" 1 / 3 \DrT x /% for 
some A,B > and D > C > 0. The outermost bandwidth 7 can be fixed 
to be some multiple, M say with M > 1, of the largest pilot bandwidth 
attempted at the outer level of the smoothed bootstrap. In our example 
we can set 7 = MBn~ l l 12 . This is in line with our perception that the 
parent sample is drawn from an underlying distribution smoother than the 
smoothed empirical distribution used for bootstrapping. For each combi- 
nation (/3, 77) = ((3 r ,r) s ) and each sample X° , we construct ^2,0 and check 
if F~}:(q) € Ii, a - The coverage probability of for each bandwidth pair 
((3 r ,i] s ), is estimated by averaging over all samples X° . The required boot- 
strap confidence interval li i0l is then constructed using the pair of band- 
widths (P r ,r] s ) that gives the minimum coverage error as estimated above. 
We note that this procedure explicitly ensures that the bandwidths selected 
have the optimal asymptotic orders as displayed in Table 1. Selection of 
bandwidths for construction of the other three bootstrap intervals can be 
dealt with in a similar way. 
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Table 2 

Estimated coverage probabilities of Ij. a , j = 1,3, with band-widths 
determined by smoothed bootstrap, for q = 0.5 and a = 0.05, 0.10, 0.90, 0.95 

Interval 1 - a = 0.05 1 - a = 0.10 1 - a = 0.90 1 - a = 0.95 



Standard normal data, N(0, 1) 





0.057 0.137 0.957 


0.955 


h,a 


0.051 0.079 0.943 


0.968 




Double-exponential data, |exp(— |a:|) 




h,a 


0.035 0.101 0.974 


0.948 


h,a 


0.058 0.112 0.904 


0.955 




Lognormal data, exp{iV(0, 1)} 




Il,a 


0.053 0.104 0.966 


0.972 




0.071 0.094 0.809 


0.893 



4. Simulation studies. Two simulation studies were conducted to inves- 
tigate the finite-sample performances of our proposed intervals. The first 
study concentrated on intervals constructed using empirically determined 
bandwidths and computed their resulting coverage probabilities. The sec- 
ond study investigated the effects of the iterated smoothed bootstrap on 
coverage probabilities of both one- and two-sided confidence intervals, with 
bandwidths fixed in advance. In both studies, we chose a = 0.05, 0.1, 0.9 and 
0.95, and simulated 1000 random samples of size n from each of three un- 
derlying distributions — the standard normal distribution, the double expo- 
nential distribution of unit rate and the standard lognormal distribution — in 
order to estimate the coverage probabilities. The kernels h, k were all taken 
to be the triangular density function x i— > 1 — \x\, defined on [—1, 1]. 

In the first study, we set q = 0.5, n = 15 and computed the coverage prob- 
abilities of I\ )Cl and Is ta constructed using bandwidths determined by the 
smoothed bootstrap procedure suggested in Section 3. We attempted empiri- 
cally a wide range of values of M and found that the choice M = 1.5 yielded 
reasonable results under most combinations of F and a. We set M = 1.5 
henceforth. Each interval was constructed using 500 smoothed bootstrap 
samples, and its coverage probability estimated from 500 outermost boot- 
strap samples X° . For Ji )Q! , the bandwidth r] for final adoption was searched 
from 20 evenly spaced values between 0.2?i~ 3 / 8 and 2n~ 1//4 . For 1%^, the 
pilot bandwidths (£,77) were selected from the 20 x 20 grid of evenly spaced 
values over the rectangle [0.2n~ 1/2 , 2n~ 3 / 8 ] x [0.2n~ 1 ^, 2n~ 1 / 6 ]. Table 2 re- 
ports the coverage probabilities of the two intervals, which agree in general 
with our theoretical result that the Studentized 1$^ is more accurate than 
the non-Studentized An exception is found for the lognormal data, 
for which variance estimation is unstable, especially for small samples, and 
renders 1%^ less accurate. 
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In the second study all four intervals /j )Q , i = 1, ... ,4, were constructed 
for the qth population quantiles, for q = 0.5 and 0.75. We also included for 
reference Beran and Hall's (1993) interpolated interval, denoted by /bh,ch 
and Chen and Hall's (1993) smoothed empirical likelihood intervals, denoted 
by /el, a if the interval is not Bartlett-corrected and by Iel(b),o if it is- Three 
sample sizes, n = 15, 30 and 100, were considered. Each smoothed bootstrap 
or bootstrap-i interval was constructed using 1000 bootstrap samples. Each 
iterated interval was constructed using 1500 outer-level and, for each outer- 
level sample, 1000 inner-level bootstrap samples. For each bootstrap-t inter- 
val, we estimated the Studentizing variance by its asymptotic expansion as 
provided in Sections 2.4 and 2.5, thus avoiding the need for one more level of 
bootstrapping. Throughout the study, all bandwidths were fixed to be their 
asymptotically optimal orders for 

convenience: r/ = n -1 / 3 for Ii )Q ; 77) = (ra -1 / 5 , re^ 1 / 3 ) for ha'i (VjO = 
(ra- 1 / 6 ,™- 1 / 2 ) for J 3;Q ; and (A?7,£) = (n" 2 / 19 , n" 11 / 57 , n" 1 / 2 ) for I A , a . For 
each empirical likelihood interval, we fixed the bandwidth at n -1 / 2 , by which 
Chen and Hall (1993) have produced reasonable results. Tables 3 and 4 
summarize the coverage figures for q = 0.5 and 0.75, respectively, for the 
cases a = 0.05 and 0.95. Coverage probabilities of two-sided intervals of 
nominal level 0.9, constructed as I?o$ = /.,o.05 \ I-,o.95, are a ^ so reported. 
Similar findings were obtained for a = 0.1 and 0.9, and are therefore omitted 
from this paper. In the case of two-sided intervals, we estimated also the 
means and variances of the interval lengths. 

We see from Tables 3 and 4 that /2,a is much more accurate than I\ a in 
most of the cases although the latter is slightly shorter and less variable. This 
confirms the finite-sample gain acquired by iterating the smoothed bootstrap 
percentile method. Similar observations are found for the bootstrap-t cases, 
where the effect of iteration is more notable at the upper end point. However, 
the degree of improvement of the iterated over the noniterated I^, a is 
less remarkable than that achieved by iterating the percentile method, which 
is not surprising given the very satisfactory coverage already registered by 
/3 iQ . The coverage figures also demonstrate that /2,» competes closely with 
/3 jQ in terms of coverage accuracy. In general has coverage probabili- 
ties comparable to those of the interpolated intervals /bh,»- Despite their 
asymptotically inferior one-sided coverage accuracy, the empirical likelihood 
intervals Iel,o and I-el(b),ol are found to be slightly more accurate than the 
bootstrap intervals. Nevertheless the accuracy of the latter improves at a 
faster rate as n increases compared to that of the empirical likelihood in- 
tervals. There is no clear winner in any case, especially for small samples. 
The two-sided non-Studentized intervals i" 2 0.9 an d -^20.9 are usually shorter 
and more stable compared to the Studentized intervals /fo.9 and ^40.9- Note 
lastly that if^.g is in general more accurate than ^el(_b)0 9' although both 
intervals have coverage errors of orders slightly smaller than n . 
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Table 3 

Estimated coverage probabilities of Ij ia for 1 — a= 0.05 ("lower") and 0.95 ("upper"), 
and of the 90% two-sided interval ( "overall") , for j = 1, 2, 3, 4, BH, EL, EL(B) 



Interval lower upper overall lower upper overall lower upper overall 









n 


= 15 






n = 30 






n 


= 100 
















Standard normal data, N(0, 1) 














Il,a 





.096 





,894 


0.798 


0.089 


0.901 


0.812 





,082 





.913 





,831 


l2,a 





.067 





,938 


0.871 


0.064 


0.941 


0.877 





,059 





.952 


0. 


,893 







.057 





932 


0.875 


0.054 


0.935 


0.881 





,058 





.935 


0. 


,877 


Ii,a 





.049 





,942 


0.893 


0.037 


0.927 


0.890 





,051 





.940 


0. 


,889 




o 


.046 


o 


,952 


0.906 


0.051 


0.950 


0.899 


o 


046 


o 


.950 


0. 


,904 


-Tel, a 





.061 





939 


0.878 


0.049 


0.943 


0.894 





,049 





.944 


0. 


,895 


^EL(B),a 





.058 





,942 


0.884 


0.049 


0.945 


0.896 





,047 





.945 


0. 


,898 












Double- 


exponential data, ^ 


exp(-|o;|) 














Il,a 





.065 





934 


0.890 


0.065 


0.928 


0.872 





,055 





.953 


0. 


904 


l2,a 





.046 





,954 


0.908 


0.050 


0.950 


0.887 





,044 





.964 


0. 


,914 


l3,a 





.044 





,955 


0.908 


0.049 


0.951 


0.899 





,040 





.952 


0. 


916 


Ii,a 





.042 





943 


0.901 


0.040 


0.946 


0.906 





,042 





.947 





,905 







.031 





,957 


0.926 


0.059 


0.951 


0.892 





,046 





.957 


0. 


,911 


-TeL, a 





.041 





947 


0.906 


0.062 


0.943 


0.881 





,053 





.958 


0. 


,905 


-fEL(B),a 





.039 





951 


0.912 


0.060 


0.945 


0.885 





,053 





.958 


0. 


,905 












Lognormal data, 


exp{jV(0,l)} 














II, a 





.066 





,801 


0.820 


0.059 


0.853 


0.863 





066 





.876 





,851 


Il,a 





.059 





,873 


0.825 


0.052 


0.900 


0.868 





,057 





.933 


0. 


,860 


l3,a 





.046 





,888 


0.865 


0.045 


0.918 


0.881 





,055 





.927 





,871 







.044 





926 


0.882 


0.036 


0.934 


0.898 





,049 





.941 


0. 


,892 







.047 





,956 


0.909 


0.051 


0.950 


0.899 





,046 





.951 


0. 


,905 


^EL, a 





.062 





,940 


0.878 


0.052 


0.945 


0.893 





,053 





.945 





,892 


^EL(B),a 





.060 





943 


0.883 


0.051 


0.946 


0.895 





053 





.945 





,892 



Sample quantile of interest is F 1 (0.5). 



5. Conclusion. We have examined the asymptotic effects of iterating the 
smoothed bootstrap on confidence intervals for quantiles, and established the 
optimal bandwidth orders which minimize the coverage error. Our construc- 
tion combines two well-known techniques for modifying the conventional 
bootstrap, smoothing and iteration, in a bootstrap procedure to produce 
very accurate confidence intervals. Through iterating the smoothed boot- 
strap, the percentile and bootstrap-t methods yield coverage errors of orders 
0(?i~ 2 / 3 ) and 0(n -58 / 57 ), respectively. The latter indeed surpasses all boot- 
strap methods proposed thus far in the literature as well as Beran and Hall's 
(1993) interpolated interval and Chen and Hall's (1993) smoothed empiri- 
cal likelihood intervals with or without Bartlett-correction. The asymptotic 
gain acquired by iterating the bootstrap in the present context is somewhat 
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Table 4 

Estimated coverage probabilities of Ij ia for 1 — a= 0.05 ("lower") and 0.95 ("upper"), 
and of the 90% two-sided interval ("overall"), for j = 1, 2, 3, 4, BH, EL, EL(B) 



Interval lower upper overall lower upper overall lower upper overall 



n = 15 n = 30 n = 100 



Standard normal data, N(0, 1) 



r, 


n 
u 


.UO 1 


n 
u 


8^8 


0.771 


0.085 


0.888 


0.803 


n 
u 


081 


n 
u 


yuu 


n 
u 


89^ 







.052 





,923 


0.871 


0.058 


0.941 


0.883 





,056 





,947 


0. 


,891 







.044 





,885 


0.841 


0.061 


0.926 


0.865 





,062 





,949 


0. 


,887 


Ii,a 





.042 





,905 


0.863 


0.043 


0.930 


0.887 





,058 





962 


0. 


,904 


Ibh,c 





.039 





,979 


0.940 


0.053 


0.950 


0.897 





,046 





,944 


0. 


,898 


-Tel, a 





.050 





,925 


0.875 


0.052 


0.941 


0.889 





,053 





932 


0. 


,878 


^EL(B),a 





.046 





932 


0.886 


0.052 


0.944 


0.892 





,053 





936 


0. 


,883 












Double- 


exponential data, ^ 


exp(-|x|) 














Il,a 





.089 





,862 


0.773 


0.079 


0.848 


0.769 





,068 


0. 


,883 


0. 


,815 


l2,a 





.062 





914 


0.852 


0.048 


0.911 


0.863 





,049 





,926 





,877 


l3,a 





.044 





,890 


0.846 


0.065 


0.927 


0.862 





,060 





,938 





,878 


l4,a 





.049 





912 


0.863 


0.056 


0.923 


0.867 





,065 





,940 


0. 


,875 


Ibh,c 





.035 





,974 


0.939 


0.057 


0.961 


0.904 





,049 





,944 





,895 


-TeL, a 





.044 





933 


0.889 


0.061 


0.950 


0.889 





,053 





,928 


0. 


,875 


4L(B),a 





.044 





939 


0.895 


0.057 


0.952 


0.895 





,052 





931 


0. 


,879 












Lognormal data, 


exp{AT(0,l)} 














A, a 





.091 





746 


0.655 


0.069 


0.798 


0.729 





,058 


0. 


,851 





793 


Il,a 





.045 





,847 


0.802 


0.052 


0.906 


0.854 





,052 





,937 





,885 


l3,a 





.049 





,844 


0.795 


0.042 


0.899 


0.857 





053 





,928 


0. 


,875 







.050 





,883 


0.833 


0.048 


0.901 


0.853 





053 





,954 


0. 


901 







.039 





,979 


0.940 


0.053 


0.950 


0.897 





,046 





,945 


0. 


,899 


^EL, a 





.051 





,918 


0.867 


0.053 


0.944 


0.891 





,051 





933 





,882 


^BL(fl),a 





.044 





,922 


0.878 


0.053 


0.949 


0.896 





051 





934 





,883 



Sample quantile of interest is F 1 (0.75). 

nonstandard in the sense that the reduction in coverage error is of an order 
smaller than 0(n~ 1 / 2 ) as is commonly the case in regular settings. Table 1 
gives a summary of the nonstandard asymptotic improvement effected by 
smoothed bootstrap iteration. 

We have also discussed the effects of using smoothed sample quantiles 
instead of sample quantiles in the construction of bootstrap intervals or 
using higher-order instead of second-order kernels for Studentization in the 
construction of bootstrap- 1 intervals. Both approaches are shown to yield no 
asymptotic gain. 

Empirical findings of our simulation study agree broadly with the asymp- 
totic theory. Bootstrap-t intervals are in general more accurate than per- 
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centile method intervals; and iterated intervals are more accurate than non- 
iterated intervals. On the other hand, the percentile method intervals do 
not require variance estimation for the sample quantile and hence possess 
the extra advantage of being stable and short compared to the bootstrap-i 
intervals of the same nominal level. 

Bootstrap iteration requires an additional level of bootstrapping, resulting 
in a computationally more intensive procedure. The apparent computational 
cost of simulating two batches of outer-level bootstrap samples, the X^s and 
the X*'s, can be alleviated as follows. First simulate one single batch of sam- 
ples (y*,W*)% where y* = (Y{,...,Y*) and W* = (W?, . . . , W*) denote 
independent random samples from F n and K, respectively. Combine y* and 
W* using the appropriate bandwidths to form the smoothed bootstrap sam- 
pies *t = (Y{ + r,Wt, ...,Y* + V W*) and X* = (Y x * +pWf,..., Y* + (3W*). 
Studentization does not pose a computational problem due to the availabil- 
ity of an explicit asymptotic formula for the variance of a sample quantile, 
which can be readily estimated in practice. 

Optimal orders of kernel bandwidths in our construction are specific to 
the types of intervals being considered. In general, the outer-level smoothed 
bootstrapping step requires a bandwidth wider than the inner level. For the 
bootstrap-t method, the bandwidth used for Studentization should be nar- 
rower than the bandwidths required by both levels of bootstrapping. The 
iterated bootstrap typically imposes stricter conditions on our choices of 
bandwidths. It would be interesting to compare the optimal orders of band- 
widths in the quantile problem with those typically recommended for more 
conventional problems. For instance, density estimation requires a band- 
width of the familiar order n -1 / 5 . Under smooth function model settings, 
asymptotic improvement over the unsmoothed bootstrap can only be ef- 
fected by a bandwidth of the order n _1//4 for the one-sided smoothed boot- 
strap percentile method interval, and of the order n~ l l 2 for either the two- 
sided smoothed bootstrap percentile method or bootstrap-i intervals; see 
Polansky and Schucany (1997). 

While we acknowledge the importance of the issue of bandwidth selection 
and have suggested a bootstrap approach to empirically setting optimal 
bandwidths, our empirical experience derived from the second simulation 
study suggests that significant improvement in terms of coverage accuracy 
can still be acquired, even for small samples, by our simplistic specification 
of the bandwidths to an arbitrary multiple, which we set as 1 in the above 
study, of their optimal orders. 

APPENDIX 

Proof of Theorem 1. We follow the proof of Theorem 3.1 in Falk and 
Janas (1992). Define the distribution function of the standardized sample 
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qth quantile by 

A nJ (t) = PV/VCF- 1 ^))^- 1 ^) - F-^q)}/^ <t}, t G R. 

Under the assumed continuity property of F, Reiss [(1989), page 119] estab- 
lished an Edgeworth expansion for A n j as 



(A.l) 



A nJ (t) = $(t) + n 



+ 0(re~ 



-1/2 



q 2(2q 



i: 



+ 



2g 



3cr, 



3(Tg 

1 + <r q f{F-\q)) 



V 2 {F-\q)) 



<P(t) 



where 5 n = 1 + nq — \nq~\ and \x\ denotes the smallest integer greater than 
or equal to x. Close examination of the proof of (A.l) shows that the expan- 
sion is actually valid uniformly for F £ J r 2{e,D\,D2) for any e € (0,q) and 
Z>i,I?2 > 0, a result analogous to Theorem 2.1 of Janas (1993). Such uniform 
validity carries over to all the expansions which follow, and we presume this 
fact without mention. We write A n for A n j if / is the uniform density func- 
tion over the interval [0, 1]. To start with, we require the following bounds 
for the distances between different versions of sample quantiles, distribution 
functions and density derivatives: 



(A.2) 
(A.3) 
(A.4) 
(A.5) 



F-J(g) - F~\q) = O p ( v 2 + n-Vy/2 + n -i; 



F- 1 (q)-F-\q) 

m _ f(d) 

J n,rj J 



O p (ri 



0p(r? 2 + n -i/ 2r? -d-i/2^ 



for d = 0,1. 



For details of the above bounds, see Falk and Janas (1992) for (A.2) and 
(A.3), Zhou (1997) for (A.4) and Jones (1994) for (A.5). 

Now we outline the key steps of the proof. Under the condition that rj tx 
n~^ v with 1/4 < A„ < 1/2, straightforward Taylor expansion in conjunction 
with the bounds (A.2)-(A.5) gives that 



(A.6) 



nGn,r l {n ll2 [Fn 1 {q)-F-\q)])<x} 

\k,(Fn l (q))(F~\q)-F~\q)) 



n 



+ -f'{F-\q)){F-\q) - F~\q)) 2 < A~ 1 (x) 



-t-oy+n^/y/ 2 ; 



Conditioning on the event that (re 1 / 2 / 'o q )(F n 1 (q) — F 1 (q))f(F 1 (q)) = u, 
followed by integrating the conditional probability over u € R with respect 
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to the distribution of the standardized sample quantile, the probability in 
(A. 6) equals 

nfn, V {Un)u/f{F~\q)) 

(A ' 7) + ^- 1/2 fn, v (F~HQ)Wa q /f(F~\ q )) 2 

<K l {x)\F n \q)=u n }K nJ {du), 

where u n = F~ l {q) + un^ 1 / 2 ^/ f(F~ 1 (q)). Reiss [(1989), page 119] argues 
that conditional on F~ 1 (q) = u n , X can be treated as a collection of n in- 
dependent random variables Y%, . . . ,Y n , where {Y±, . . . ,Yr nq ii\ constitutes 
a random sample from the right truncated density f (x) / F (u n )l{x < u n }, 
Y\nq] = u n and {yj- n(? "|+i, • • • , Y n } is a random sample from the left trun- 
cated density f(x)/(l — F(u n ))l{x > u n }, with 1{-} denoting the indica- 
tor function. It follows that the conditional probability in (A. 7) equals the 
unconditional probability that X^=i^ri,«,t < A" 1 (2;), f° r a sum °f indepen- 
dent random variables T n u s = T n u 

,i(Yi)- L et Hn,u and u n u be the mean 
and variance of Z)^=i^n,«,t; respectively. We find by the delta method that 
Mn,u = u{l + R n (u)} and a UjU = \u\a n (u), where 

and 

a n (u) = (nrr 1/2 (jf^fi) ^ + 0(n^W /2 + n^V 5 / 2 ). 

Standardizing Y%=i T n,u,i to S n , u = (E?=i r n,-M ~ fJ>n,u) / <Tn,u and decompos- 
ing the characteristic function of S ntU into factors contributed, respectively, 

by the partial sums Ej-=i r n,u,j, r n,«,rn?l and Ey=f^]+i we can 

derive an Edgeworth expansion for the distribution of S UiU using standard 
arguments and rewrite (A. 7) as 

< <& n (u) + Q n (u))A n>f (du) 

mi 

i=l 

(A.8) +Y,iPn,u^\K{u))@n{uy/j\ \Kj{du) 

+ 0(r ? 2 + n-V 4 + n" 1/ V /2 ) 
{$(*?„(«)) + C»(«)}A^/(du) 

+ 0(r? 2 + n~ 2 r/~ 4 + n~ 1/2 r? 1/2 ) say, 
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where = (A" 1 ^) - u) / (\u\a n (u)) , & n (u) = -sign(u) R n (u)/a n (u), 

V (z) - (nvr if2{ ^n(u)f(F-H q ))Ik(vfdv W Z ^ - 1\ 
p n , u[ z)-{n V ) | (/(F-i( g ))/fc( w )2 du )3/2 J ^ 6 r {Z) 

+ O p {n-^l% 

and the are chosen such that (f? -2 + r?rf + n 1 / 2 ^ -1 ' 2 )0 n (tt) mi+1 — > 0, 
for i = l,2. 

Note that C, n {u) is bounded by terms of the form |i9 n (u)p : |© n (it)^(i? n (u)), 
for positive integers j, For any fixed it G (— oo,2y] U [2y/3, oo) and any 
fixed y < 0, |i9 n (it)| J : |0 n (u)|^(?9 n (tt)) has order 0(n~ x ), for any fixed A > 
and sufficiently large n. The same applies to $(i? n (it)) if u > 2y/3, and to 
1 — <&($ n (tt)) if it < 2y. Assuming without loss of generality A~ x (x) < and 
writing y = A~ 1 (x), it follows from the above bounds and (A. 8) that the 
integral (A. 7) equals 

/■2j//3 ry 

/ $(# n (u))K,f(du) + / - l)A n)/ (du) 

Jy J2y 

(A.9) +/ Cn{u)Kj{du)+Kj{y) + 0{rf + n- 2 r ] - 4 + n- 1 l 2 r ] l ' 2 ) 

J2y 

= I n + II n + III n + K nJ (y) 

+ 0(if +n~V 4 + ™~ 1/ V /2 ) say. 

It follows from term- by-term integration with the aid of (A.l) that 
I n = -II n + 0((n V y 1 ) 

= (2.)^/ 2 (nr ? )-V2^iM^y /2 | y | A ; /(y ) +0 ((n ? r 1/2 ) 



(A.10) 



and 



(All) n V 2a ? + 2/(F~^)) 2 r W 



+ 0( r? 2 +n -2 r? -4 +n -l/2 r? l/2 ) 

Inverting A n and substituting into (A.l), we get 

(A.12) K nJ (y) = x + n- 1 ^ ( ^/^i^a ) (^W^*" 1 ^)) + O^ 1 ). 

The expansion (1) then follows by combining (A. 6), (A.9)-(A.12), putting 
x = a and taking the complement of the probability. □ 



Proof of Theorem 2. Arguing as in Janas (1993), we see that the 
conditions on k and imply that ¥(F n ^ 6 T^ie, D\, D2)) = 1 — o(?i~ A ) for 
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any A > 0. It follows that, on substitution of F n a for F in (1), the iterated 
version of the probability has the expansion 

(A.!3, - * - + 

+ O p (7? 2 + n-V 4 + n- 1/ V /2 )- 
It follows from the bounds (A.2)-(A.5) that J~p^{a) differs from the ath 
quantile of the prepivoted root G n ^(n l / 2 [F~ l (q) — F~ 1 (q)]) by an order of 
O p (rj 2 + n~ 2 ??~ 4 + n~ 1/2 r] 1/2 + n~ 1/2 /3 2 + n" 1 /?" 3 / 2 ) . Theorem 2 then follows 
by the delta method. □ 

PROOF of Theorem 3. We denote in the sequel by Ci ii (F),C 2l i(F), . . . 
generic smooth functions of density derivatives {f(F~ 1 (q)),f'(F~ 1 (q)), . . . , f^(F~ 1 (q))}, 
for each i = 0, 1, In cases where Cj^(F) assumes the form of a polyno- 
mial in a variable x, we write Cj t i(F) = Cj,i{F){x). As in (A. 7), we define 
u n = F~ l (q) + un~ l / 2 a q / f(F~ l (q)) and write 

(A.14) K n4 (x) = f nL^Un)u/f(F-\q)) < x\F-\q) = u n }K nJ {du). 



Arguing as in the proof of Theorem 1, we show that the conditional distri- 
bution of the standardized form of f n ^{u n )'U'/f(F~ 1 (q)), given F~ 1 {q) = u n , 
has an Edgeworth expansion 

<S>(y) + Hr 1/2 C lfl (F)(y) 
(A ^ +(nO- 1 C 2 , (F)(y)+n~ 1 / 2 e /2 C 3fi (F)(y) 

1 ' ; +nr 3/2 C4, (F)(y) + n- 1 C 1/2 c 5 , 1 (F)(y) 

+ O p (n~ l + n- 3 / 2 r 1 + «)- 2 + n- l ' 2 e 12 ). 

Plugging (A. 15) into (A.14) and splitting the integral as in (A. 9), we see in 
the present context that 

/ n + // n = «)- 1 C 6 ,o(F)(x) 
(A lfil + «r 3/2 C 7 , (F)(x) + «r 2 C 8 , (F)(x) 

1 J +n- 1 C 9 , (F)(x)+n~ 3 / 2 C 1 C w ,i(.F)(x) 

+0 (n- i e /2 +n- 2 c 3/2 +«r 5/2 ) 

and 

III n = (nO~ 1 C llfi (F)(x)+n- 1 / 2 C 12 ,i(F)(x) 

+ ( n O~ 3 ^ 2 Ci3,o(F)(x) + n~ 1 ^~ 1 / 2 Ci4 i i(F)(x) 
+ n^ 2 e /2 C 15A (F)(x) + n- l C^ 2 {F){x) 
(A.17) +n-V 2 C 1 C 17 ,i(F)(x)+fC 18t2 (F)(x) 

+ n- 1 /2 e c 19 , 1 (F)(x) + (n()- 2 C 20 , (F)(x) 

+ o(n- 3 / 2 c 1/2 + Kr 5/2 + n~ 1/2 e 3/2 

+ n - 2 r 3/2 + n^e 12 + e 12 + n^e 912 ). 
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It follows by noting (A.l) and combining (A. 16) and (A. 17) that 

K n ^{x) = <f>(x) + (nO~ 1 C 2 i,o{F)(x) 

+ n^ 2 C 22A (F)(x) + «)- 3/2 C 23 ,o(F)(x) 

(A.18) +n~ 1 / 2 e /2 C 25 , 1 (F)(x) +n~ 3 / 2 C 1 C 26 , 1 (F)(x) 

+ (nCr 2 C 27fi (F)(x) + n- 1 C 28 , 2 (F)(x) + fC 29 , 2 (F)(x) 
+ n- 1 ' 2 tC m . l {F){x) 

+ 0(n~ 2 C 3/2 + K)" 5/2 + n 1/2 £ 9/2 + e' 2 + n 3 / 2 e 9/2 )- 

Note that (A.18) extends the expansion given in Theorem 2.1 of Janas 
(1993) by including higher-order terms. Arguments similar to those pro- 
vided by Janas (1993) can be used to show that (A.18) holds uniformly in 
F e F 3 (e,D 1 ,D 2 ). The assumptions (A1)-(A3) and that A„ G (0,1/5) imply 
that F ntTI £ Fz(e,D\,D 2 ) with negligible probability. An asymptotic expan- 
sion similar to (A.18) thus holds for K n ^{x), with F substituted by F n>7] . 
Standard Taylor expansion together with bounds (A.2)-(A.5), with (A. 5) 
strengthened to include cases d = 2,3, yields an expansion for the difference 

K-^(x)-K-^(x) 

= k n ,^(x) + O p ((n£)~ Y + rT x 'W + n" 3 ^" Y /2 + n\ 112 

+ (< )~5/2 + e 5/2 + <5 + n -3/ 2r? -5/2 

+ n- 1 / 2 ev~ 5/2 +n- l ' 2 e ,2 r l 2 + K^Y), 



where 



k n ^{x) = {(nCr 1 C 31fi (F)(x) +n- 1 / 2 C 32 ,i(F)(x) 

+ (nO~ 3/2 C 3 3.o(F)(x) + n- l C l/2 C u ,i{F){x) 
+ n~ l / 2 e /2 C^ l {F){x)} 

x[fn,v(Fn\Q))-f(F-Hq))] 
+ {n- 1 / 2 C 36 , 1 (F)(x)+n- 1 r 1/2 C 37 , 1 (F)(x) 
+ n~ 1 / 2 e /2 C 38A (F)(x)} 

xlfUF-'iqV-f'iF-Hq))} 
+ {K)- 1 r ? 2 C'39,i(^)(^)+n- 1 /VC4o,i(^)(^)}/; ?) (^7 1 (9)) 
+ n- 1 / 2 r l 2 C Ahl (F)(x)fZ v (F- 1 (q)). 
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Note, by conditioning and integrating as in (A. 14), that 

nK n ^ 1/2 [(Fn\q) - F-\q))/h\) > «} 
= jnkd u n)u/f{F-\q)) 

(A.19) ~ Kn, V ,t( a ) ^ K n\( a )\ F n l = U n }A nJ (du) 

+ + (< )-5/2 + ^5/2 + <5 + n -3/ 2r? -5/2 

+ n-^frj- 5 / 2 + rT^i/y + K )-3/2 ??2)> 

Consider first the integral 

K n ,rt,dy) = I HL,d u nh/ f(F~\q)) - k n , v ,c(a) < ylF^Hq) = u n }A nJ {du). 



Proceeding, with lengthy algebra, as in establishing the Edgeworth expan- 
sion for the conditional probability in (A. 7), we see that the conditional dis- 
tributions of the standardized f n ,t(u n )u/ /(i 7-1 ^)) an d fn,((u n )u/ /(F -1 ^)) - 
K n ^(a), given that = u n , have the same Edgeworth expansion up 

toO p («)- 2 + n-V2£3/2 + ^3/2^-1/2^-1 + + ^-l^-S) ft 

follows, by lengthy algebra again, that the extra term k n )J? ^ (a) in (A.19) con- 
tributes only to the ZZ7 n component of the integral (A. 14) through EK njT? ^(a), 
up to order 

0(K) -5/2 +< 5 + £5/2 + n -l/2 7? 4 

+ n -l ?? l/2 + n-3/2^-5/2 + ^3/2^-1/2^-1 

+ «)-y + n- x r V V + (™?)- 1 e 1/2 + rr 1/2 £ 17 y 
+n - 3 / 2 rW /2 +n~ 1/2 fv- 5/2 + «r 3/ V). 

Explicit expansion of the contribution of k n jJ?) ^(a) yields that K n ^^{y) dif- 
fers from K n ^(y) by 

(n7 7 )- 1 C 42! i(F)(y) + «ryC43,2(^)(y) +n- 1 / 2 r / 2 C 44 ,3(i ? )(y) 

+ n- 1 C 45 ,2(i ? )(y) + n- 3 / 2 r 1 C 46 ,i (F)(y) 

up to the above order, uniformly in F £ T±{e % Di, D2). Theorem 3 now fol- 
lows by putting y = K~^(a) and taking the complement. □ 

Proof of Theorem 4. As in the proof of Theorem 2, when < 
< 1/7, we have F n ^ £ ^(e, D\, D2) with negligible probability. Ap- 
plication of the bounds (A.2)-(A.4) and extending (A. 5) to cases d = 2, 3 
establish that the bootstrap quantile L~p g(a) and the true quantile of 

Kn,v£ ( n1 ^ 2 [(Fn 1 (?) - differ in probability by the error term as 

specified in (3). Theorem 4 then follows by the delta method. □ 
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Proof of Theorem 5. Write * n ,c(0 = G n ^(ta q / f(F~ 1 (q))) and ^f niV ^(t) 
GntfjCfaq/ ' fn,T){Fnh(q))) for t 6 R. Standard Edgeworth expansion gives that 

^ n>c (t) = m + {n- 1/2 E 1}1 (F) + CE 2fi (F)+n 1 / 2 eE 3 , 1 (F)} ( f>(t) 
(A - 20) + 0(n" 1 + n 3 / 2 C 4 ) 



uniformly in F € .^(e, -Di, D2) for any e € (0, <?) and D\,D2 > 0, where 

Ejj(F) denotes a smooth function of the density derivatives {f(F~ 1 (q)), . . . , /^(i* 1-1 ((?))} 

for each j = 1,2,3. On substitution of F n ^ for F in (A. 20) and using (A.2)- 

(A.5), we have 

+ O^n" 1 + n 3 / 2 C 4 + V 2 C + n^V^C + n^ifC? + ?T 3 / 2 C 2 ). 
We obtain by the delta method that the coverage probability of If a equals 

1 J + Oin- 1 + n 3 / 2 C 4 + r/ 2 C + n"V /2 C + ™ V VC 2 + ?T 3/2 C 2 ), 

where A = <7g[l// n ,„(F-J( 9 )) - l//^" 1 (<?))] . Replacing F n ,„ by F n>c in 
(A. 2), using (A. 3), (A. 5) and noting the bounds assumed on 77, £, we have 
that the joint cumulants of (n l / 2 (F~^(q) — F~ 1 (q)),A) differ from those 

of (n 1 / 2 (F~ 1 (q) — F~ 1 (q)),A) by at most 0(n -25 / 48 ). An expansion anal- 
ogous to (A. 21) holds for the interval ii jQ , with the definition of A un- 
changed. Comparison with (1) then implies that the joint cumulants of 
(n 1 ^ 2 (F~ 1 (q) — F~ 1 (q)),A) contribute a term of precise order n -1 / 2 to the 
coverage error of Ii <a . It follows that the Edgeworth expansions for the dis- 
tributions of n 1 /2( j p-J( g ) _ F-^q)) and n 1 / 2 ^"^) - F' 1 ^)) - (a)A 

also differ by an order of n~ 1 / 2 precisely, so that, by (A. 21), the coverage 
probability of If a has the expansion stated in (4). □ 
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